Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 222500 |
| Missing cells | 234318 |
| Missing cells (%) | 6.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 13.8 MiB |
| Average record size in memory | 65.0 B |
Variable types
| NUM | 8 |
|---|---|
| CAT | 6 |
| BOOL | 2 |
Reproduction
| Analysis started | 2020-03-26 17:53:19.113986 |
|---|---|
| Analysis finished | 2020-03-26 19:22:23.734206 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
Census_OEMNameIdentifier has 2243 (1.0%) missing values | Missing |
Census_OEMModelIdentifier has 2401 (1.1%) missing values | Missing |
Census_ProcessorClass has 221498 (99.5%) missing values | Missing |
| Distinct count | 222500 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 223242.75157752808 |
|---|---|
| Minimum | 3 |
| Maximum | 446244 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 22424.95 |
| Q1 | 111764.5 |
| median | 223196.5 |
| Q3 | 334939.25 |
| 95-th percentile | 423972.1 |
| Maximum | 446244 |
| Range | 446241 |
| Interquartile range (IQR) | 223174.75 |
Descriptive statistics
| Standard deviation | 128837.2552 |
|---|---|
| Coefficient of variation (CV) | 0.577117305 |
| Kurtosis | -1.20097224 |
| Mean | 223242.7516 |
| Median Absolute Deviation (MAD) | 111586.7464 |
| Skewness | 0.0003429752968 |
| Sum | 4.967151223e+10 |
| Variance | 1.659903832e+10 |
| Value | Count | Frequency (%) | |
| 4094 | 1 | < 0.1% | |
| 204086 | 1 | < 0.1% | |
| 62795 | 1 | < 0.1% | |
| 58697 | 1 | < 0.1% | |
| 60744 | 1 | < 0.1% | |
| 36164 | 1 | < 0.1% | |
| 46403 | 1 | < 0.1% | |
| 48450 | 1 | < 0.1% | |
| 42305 | 1 | < 0.1% | |
| 44352 | 1 | < 0.1% | |
| Other values (222490) | 222490 | > 99.9% |
| Value | Count | Frequency (%) | |
| 3 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 446244 | 1 | < 0.1% | |
| 446241 | 1 | < 0.1% | |
| 446235 | 1 | < 0.1% | |
| 446231 | 1 | < 0.1% | |
| 446224 | 1 | < 0.1% |
Census_MDC2FormFactor
Categorical
| Distinct count | 12 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 217.8 KiB |
| Notebook | |
|---|---|
| Desktop | |
| Convertible | 10173 |
| AllInOne | 7410 |
| Detachable | 5698 |
| Other values (7) | 4751 |
| Value | Count | Frequency (%) | |
| Notebook | 144007 | 64.7% | |
| Desktop | 50461 | 22.7% | |
| Convertible | 10173 | 4.6% | |
| AllInOne | 7410 | 3.3% | |
| Detachable | 5698 | 2.6% | |
| PCOther | 3278 | 1.5% | |
| LargeTablet | 1012 | 0.5% | |
| SmallTablet | 264 | 0.1% | |
| SmallServer | 131 | 0.1% | |
| MediumServer | 47 | < 0.1% | |
| Other values (2) | 19 | < 0.1% |
Length
| Max length | 12 |
|---|---|
| Mean length | 7.966930337 |
| Min length | 7 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 19 | 63.3% | |
| Uppercase_Letter | 11 | 36.7% |
| Value | Count | Frequency (%) | |
| Latin | 30 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 30 | 100.0% |
Census_DeviceFamily
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 217.5 KiB |
| Windows.Desktop | |
|---|---|
| Windows.Server | 241 |
| Windows | 1 |
| Value | Count | Frequency (%) | |
| Windows.Desktop | 222258 | 99.9% | |
| Windows.Server | 241 | 0.1% | |
| Windows | 1 | < 0.1% |
Length
| Max length | 15 |
|---|---|
| Mean length | 14.9988809 |
| Min length | 7 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 12 | 75.0% | |
| Uppercase_Letter | 3 | 18.8% | |
| Other_Punctuation | 1 | 6.2% |
| Value | Count | Frequency (%) | |
| Latin | 15 | 93.8% | |
| Common | 1 | 6.2% |
| Value | Count | Frequency (%) | |
| ASCII | 16 | 100.0% |
| Distinct count | 1009 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 2243 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2197.5884 |
|---|---|
| Minimum | 74.0 |
| Maximum | 6143.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 869.3 KiB |
Quantile statistics
| Minimum | 74 |
|---|---|
| 5-th percentile | 525 |
| Q1 | 1443 |
| median | 2102 |
| Q3 | 2668 |
| 95-th percentile | 4730 |
| Maximum | 6143 |
| Range | 6069 |
| Interquartile range (IQR) | 1225 |
Descriptive statistics
| Standard deviation | 1298.672241 |
|---|---|
| Coefficient of variation (CV) | 0.5909533501 |
| Kurtosis | -0.3988922536 |
| Mean | 2197.588379 |
| Median Absolute Deviation (MAD) | 996.3306274 |
| Skewness | 0.5477777123 |
| Sum | 484034240 |
| Variance | 1686549.625 |
| Value | Count | Frequency (%) | |
| 2668 | 32453 | 14.6% | |
| 2102 | 26892 | 12.1% | |
| 1443 | 24130 | 10.8% | |
| 2206 | 22459 | 10.1% | |
| 585 | 22198 | 10.0% | |
| 525 | 21947 | 9.9% | |
| 4589 | 8562 | 3.8% | |
| 1980 | 7915 | 3.6% | |
| 4730 | 7243 | 3.3% | |
| 4142 | 4624 | 2.1% | |
| Other values (999) | 41834 | 18.8% |
| Value | Count | Frequency (%) | |
| 74 | 3 | < 0.1% | |
| 82 | 2 | < 0.1% | |
| 86 | 5 | < 0.1% | |
| 165 | 3 | < 0.1% | |
| 176 | 16 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6143 | 1 | < 0.1% | |
| 6142 | 1 | < 0.1% | |
| 6095 | 1 | < 0.1% | |
| 6086 | 2 | < 0.1% | |
| 6062 | 12 | < 0.1% |
| Distinct count | 24613 |
|---|---|
| Unique (%) | 11.2% |
| Missing | 2401 |
| Missing (%) | 1.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 238712.25 |
|---|---|
| Minimum | 23.0 |
| Maximum | 345496.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 869.3 KiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 83257 |
| Q1 | 189586 |
| median | 245971 |
| Q3 | 303366 |
| 95-th percentile | 331210.0938 |
| Maximum | 345496 |
| Range | 345473 |
| Interquartile range (IQR) | 113780 |
Descriptive statistics
| Standard deviation | 72003.53125 |
|---|---|
| Coefficient of variation (CV) | 0.3016331494 |
| Kurtosis | 1.158289671 |
| Mean | 238712.25 |
| Median Absolute Deviation (MAD) | 54601.05078 |
| Skewness | -0.9952222109 |
| Sum | 5.254032589e+10 |
| Variance | 5184508416 |
| Value | Count | Frequency (%) | |
| 313586 | 8448 | 3.8% | |
| 242491 | 6770 | 3.0% | |
| 317701 | 3736 | 1.7% | |
| 317708 | 3087 | 1.4% | |
| 188345 | 2031 | 0.9% | |
| 245824 | 2012 | 0.9% | |
| 228975 | 2004 | 0.9% | |
| 241876 | 1793 | 0.8% | |
| 244755 | 1349 | 0.6% | |
| 248045 | 1047 | 0.5% | |
| Other values (24603) | 187822 | 84.4% | |
| (Missing) | 2401 | 1.1% |
| Value | Count | Frequency (%) | |
| 23 | 2 | < 0.1% | |
| 150 | 4 | < 0.1% | |
| 156 | 6 | < 0.1% | |
| 167 | 1 | < 0.1% | |
| 171 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 345496 | 1 | < 0.1% | |
| 345490 | 1 | < 0.1% | |
| 345485 | 1 | < 0.1% | |
| 345433 | 1 | < 0.1% | |
| 345410 | 1 | < 0.1% |
Census_ProcessorCoreCount
Real number (ℝ≥0)
| Distinct count | 20 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 1094 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | nan |
|---|---|
| Minimum | 1.0 |
| Maximum | 64.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 434.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 8 |
| Maximum | 64 |
| Range | 63 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0 |
|---|---|
| Coefficient of variation (CV) | nan |
| Kurtosis | nan |
| Mean | nan |
| Median Absolute Deviation (MAD) | nan |
| Skewness | nan |
| Sum | inf |
| Variance | 0 |
| Value | Count | Frequency (%) | |
| 4 | 137649 | 61.9% | |
| 2 | 52982 | 23.8% | |
| 8 | 24015 | 10.8% | |
| 12 | 2741 | 1.2% | |
| 6 | 1968 | 0.9% | |
| 1 | 1032 | 0.5% | |
| 16 | 521 | 0.2% | |
| 3 | 308 | 0.1% | |
| 32 | 58 | < 0.1% | |
| 24 | 50 | < 0.1% | |
| Other values (10) | 82 | < 0.1% | |
| (Missing) | 1094 | 0.5% |
| Value | Count | Frequency (%) | |
| 1 | 1032 | 0.5% | |
| 2 | 52982 | 23.8% | |
| 3 | 308 | 0.1% | |
| 4 | 137649 | 61.9% | |
| 5 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 64 | 3 | < 0.1% | |
| 56 | 4 | < 0.1% | |
| 48 | 4 | < 0.1% | |
| 40 | 12 | < 0.1% | |
| 36 | 6 | < 0.1% |
Census_ProcessorManufacturerIdentifier
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 1095 |
| Missing (%) | 0.5% |
| Memory size | 434.7 KiB |
| 5 | |
|---|---|
| 1 | 25296 |
| 3 | 2 |
| Value | Count | Frequency (%) | |
| 5 | 196107 | 88.1% | |
| 1 | 25296 | 11.4% | |
| 3 | 2 | < 0.1% | |
| (Missing) | 1095 | 0.5% |
Length
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 4 | 57.1% | |
| Lowercase_Letter | 2 | 28.6% | |
| Other_Punctuation | 1 | 14.3% |
| Value | Count | Frequency (%) | |
| Common | 5 | 71.4% | |
| Latin | 2 | 28.6% |
| Value | Count | Frequency (%) | |
| ASCII | 7 | 100.0% |
Census_ProcessorModelIdentifier
Real number (ℝ≥0)
| Distinct count | 1887 |
|---|---|
| Unique (%) | 0.9% |
| Missing | 1095 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2389.8523 |
|---|---|
| Minimum | 19.0 |
| Maximum | 4472.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 869.3 KiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 311 |
| Q1 | 2056 |
| median | 2523 |
| Q3 | 2883 |
| 95-th percentile | 3426 |
| Maximum | 4472 |
| Range | 4453 |
| Interquartile range (IQR) | 827 |
Descriptive statistics
| Standard deviation | 828.9053955 |
|---|---|
| Coefficient of variation (CV) | 0.3468437791 |
| Kurtosis | 1.441684008 |
| Mean | 2389.852295 |
| Median Absolute Deviation (MAD) | 580.838562 |
| Skewness | -1.051075339 |
| Sum | 529125248 |
| Variance | 687084.125 |
| Value | Count | Frequency (%) | |
| 2697 | 7829 | 3.5% | |
| 1998 | 6575 | 3.0% | |
| 2660 | 5332 | 2.4% | |
| 2373 | 4639 | 2.1% | |
| 2382 | 4480 | 2.0% | |
| 2640 | 4245 | 1.9% | |
| 1992 | 4101 | 1.8% | |
| 2737 | 3215 | 1.4% | |
| 3063 | 3208 | 1.4% | |
| 1985 | 3184 | 1.4% | |
| Other values (1877) | 174597 | 78.5% |
| Value | Count | Frequency (%) | |
| 19 | 34 | < 0.1% | |
| 23 | 4 | < 0.1% | |
| 25 | 7 | < 0.1% | |
| 27 | 3 | < 0.1% | |
| 29 | 94 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4472 | 1 | < 0.1% | |
| 4469 | 1 | < 0.1% | |
| 4468 | 1 | < 0.1% | |
| 4446 | 1 | < 0.1% | |
| 4437 | 1 | < 0.1% |
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 221498 |
| Missing (%) | 99.5% |
| Memory size | 217.5 KiB |
| mid | |
|---|---|
| low | |
| high |
| Value | Count | Frequency (%) | |
| mid | 566 | 0.3% | |
| low | 228 | 0.1% | |
| high | 208 | 0.1% | |
| (Missing) | 221498 | 99.5% |
Length
| Max length | 4 |
|---|---|
| Mean length | 3.000934831 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 10 | 100.0% |
| Value | Count | Frequency (%) | |
| Latin | 10 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 10 | 100.0% |
Census_PrimaryDiskTotalCapacity
Real number (ℝ≥0)
| Distinct count | 605 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 1379 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 531557.4154377015 |
|---|---|
| Minimum | 0.0 |
| Maximum | 11445248.0 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 59640 |
| Q1 | 244198 |
| median | 476940 |
| Q3 | 953869 |
| 95-th percentile | 953869 |
| Maximum | 11445248 |
| Range | 11445248 |
| Interquartile range (IQR) | 709671 |
Descriptive statistics
| Standard deviation | 357291.7955 |
|---|---|
| Coefficient of variation (CV) | 0.6721603069 |
| Kurtosis | 11.87550453 |
| Mean | 531557.4154 |
| Median Absolute Deviation (MAD) | 279611.7992 |
| Skewness | 1.401335489 |
| Sum | 1.175385073e+11 |
| Variance | 1.276574271e+11 |
| Value | Count | Frequency (%) | |
| 476940 | 70823 | 31.8% | |
| 953869 | 58447 | 26.3% | |
| 122104 | 12382 | 5.6% | |
| 244198 | 11876 | 5.3% | |
| 305245 | 10518 | 4.7% | |
| 238475 | 7541 | 3.4% | |
| 114473 | 7054 | 3.2% | |
| 29820 | 6368 | 2.9% | |
| 715404 | 6196 | 2.8% | |
| 228936 | 4720 | 2.1% | |
| Other values (595) | 25196 | 11.3% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 10227 | 1 | < 0.1% | |
| 14800 | 24 | < 0.1% | |
| 14910 | 2 | < 0.1% | |
| 14912 | 19 | < 0.1% |
| Value | Count | Frequency (%) | |
| 11445248 | 1 | < 0.1% | |
| 11444736 | 1 | < 0.1% | |
| 5723166 | 2 | < 0.1% | |
| 5723091 | 1 | < 0.1% | |
| 5376000 | 1 | < 0.1% |
Census_PrimaryDiskTypeName
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 289 |
| Missing (%) | 0.1% |
| Memory size | 217.6 KiB |
| HDD | |
|---|---|
| SSD | |
| UNKNOWN | 8014 |
| Unspecified | 6042 |
| Value | Count | Frequency (%) | |
| HDD | 146901 | 66.0% | |
| SSD | 61254 | 27.5% | |
| UNKNOWN | 8014 | 3.6% | |
| Unspecified | 6042 | 2.7% | |
| (Missing) | 289 | 0.1% |
Length
| Max length | 11 |
|---|---|
| Mean length | 3.36131236 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 9 | 52.9% | |
| Uppercase_Letter | 8 | 47.1% |
| Value | Count | Frequency (%) | |
| Latin | 17 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 17 | 100.0% |
Census_SystemVolumeTotalCapacity
Real number (ℝ≥0)
| Distinct count | 83754 |
|---|---|
| Unique (%) | 37.9% |
| Missing | 1378 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 382266.9740686137 |
|---|---|
| Minimum | 0.0 |
| Maximum | 5375384.0 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40960 |
| Q1 | 121027 |
| median | 250203 |
| Q3 | 476164 |
| 95-th percentile | 952728 |
| Maximum | 5375384 |
| Range | 5375384 |
| Interquartile range (IQR) | 355137 |
Descriptive statistics
| Standard deviation | 323012.6715 |
|---|---|
| Coefficient of variation (CV) | 0.8449923571 |
| Kurtosis | 4.306829133 |
| Mean | 382266.9741 |
| Median Absolute Deviation (MAD) | 252225.7663 |
| Skewness | 1.557900475 |
| Sum | 8.452763784e+10 |
| Variance | 1.043371859e+11 |
| Value | Count | Frequency (%) | |
| 926992 | 1263 | 0.6% | |
| 953253 | 1119 | 0.5% | |
| 476389 | 1115 | 0.5% | |
| 28542 | 1106 | 0.5% | |
| 476324 | 1042 | 0.5% | |
| 952728 | 961 | 0.4% | |
| 102400 | 945 | 0.4% | |
| 475799 | 864 | 0.4% | |
| 476323 | 841 | 0.4% | |
| 952792 | 835 | 0.4% | |
| Other values (83744) | 211031 | 94.8% | |
| (Missing) | 1378 | 0.6% |
| Value | Count | Frequency (%) | |
| 0 | 2 | < 0.1% | |
| 7667 | 1 | < 0.1% | |
| 9676 | 1 | < 0.1% | |
| 9782 | 1 | < 0.1% | |
| 10000 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5375384 | 1 | < 0.1% | |
| 3906691 | 1 | < 0.1% | |
| 3815430 | 1 | < 0.1% | |
| 3814881 | 1 | < 0.1% | |
| 3814880 | 1 | < 0.1% |
Census_HasOpticalDiskDrive
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 0 | |
|---|---|
| 1 | 18503 |
| Value | Count | Frequency (%) | |
| 0 | 203997 | 91.7% | |
| 1 | 18503 | 8.3% |
Census_TotalPhysicalRAM
Real number (ℝ≥0)
| Distinct count | 226 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 1831 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6422.0474 |
|---|---|
| Minimum | 768.0 |
| Maximum | 262144.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 869.3 KiB |
Quantile statistics
| Minimum | 768 |
|---|---|
| 5-th percentile | 2048 |
| Q1 | 4096 |
| median | 4096 |
| Q3 | 8192 |
| 95-th percentile | 16384 |
| Maximum | 262144 |
| Range | 261376 |
| Interquartile range (IQR) | 4096 |
Descriptive statistics
| Standard deviation | 5048.120117 |
|---|---|
| Coefficient of variation (CV) | 0.7860608697 |
| Kurtosis | 148.8574677 |
| Mean | 6422.047363 |
| Median Absolute Deviation (MAD) | 3208.66626 |
| Skewness | 6.831326485 |
| Sum | 1417146752 |
| Variance | 25483518 |
| Value | Count | Frequency (%) | |
| 4096 | 101747 | 45.7% | |
| 8192 | 58972 | 26.5% | |
| 2048 | 22297 | 10.0% | |
| 16384 | 15231 | 6.8% | |
| 6144 | 10444 | 4.7% | |
| 12288 | 4489 | 2.0% | |
| 3072 | 3241 | 1.5% | |
| 32768 | 1632 | 0.7% | |
| 1024 | 809 | 0.4% | |
| 24576 | 359 | 0.2% | |
| Other values (216) | 1448 | 0.7% | |
| (Missing) | 1831 | 0.8% |
| Value | Count | Frequency (%) | |
| 768 | 1 | < 0.1% | |
| 991 | 1 | < 0.1% | |
| 1013 | 1 | < 0.1% | |
| 1014 | 1 | < 0.1% | |
| 1015 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 262144 | 2 | < 0.1% | |
| 196608 | 2 | < 0.1% | |
| 131072 | 28 | < 0.1% | |
| 98304 | 3 | < 0.1% | |
| 90112 | 1 | < 0.1% |
Census_ChassisTypeName
Categorical
| Distinct count | 35 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 15 |
| Missing (%) | < 0.1% |
| Memory size | 218.9 KiB |
| Notebook | |
|---|---|
| Desktop | |
| Laptop | 16821 |
| Portable | 8758 |
| AllinOne | 5185 |
| Other values (30) | 11483 |
| Value | Count | Frequency (%) | |
| Notebook | 131600 | 59.1% | |
| Desktop | 48638 | 21.9% | |
| Laptop | 16821 | 7.6% | |
| Portable | 8758 | 3.9% | |
| AllinOne | 5185 | 2.3% | |
| MiniTower | 2217 | 1.0% | |
| Convertible | 2030 | 0.9% | |
| UNKNOWN | 1530 | 0.7% | |
| LowProfileDesktop | 1278 | 0.6% | |
| Other | 985 | 0.4% | |
| Other values (25) | 3443 | 1.5% |
Length
| Max length | 19 |
|---|---|
| Mean length | 7.717640449 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 21 | 50.0% | |
| Uppercase_Letter | 17 | 40.5% | |
| Decimal_Number | 4 | 9.5% |
| Value | Count | Frequency (%) | |
| Latin | 38 | 90.5% | |
| Common | 4 | 9.5% |
| Value | Count | Frequency (%) | |
| ASCII | 42 | 100.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| df_index | Census_MDC2FormFactor | Census_DeviceFamily | Census_OEMNameIdentifier | Census_OEMModelIdentifier | Census_ProcessorCoreCount | Census_ProcessorManufacturerIdentifier | Census_ProcessorModelIdentifier | Census_ProcessorClass | Census_PrimaryDiskTotalCapacity | Census_PrimaryDiskTypeName | Census_SystemVolumeTotalCapacity | Census_HasOpticalDiskDrive | Census_TotalPhysicalRAM | Census_ChassisTypeName | HasDetections | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3 | Notebook | Windows.Desktop | 1443.0 | 256685.0 | 4.0 | 5.0 | 3063.0 | NaN | 953869.0 | HDD | 149500.0 | 0 | 8192.0 | Notebook | 1 |
| 1 | 5 | Notebook | Windows.Desktop | 4894.0 | 272191.0 | 2.0 | 5.0 | 2097.0 | NaN | 953869.0 | HDD | 953160.0 | 0 | 4096.0 | Notebook | 1 |
| 2 | 6 | Desktop | Windows.Desktop | 1443.0 | 275893.0 | 8.0 | 5.0 | 2962.0 | NaN | 244198.0 | SSD | 243645.0 | 0 | 16384.0 | MiniTower | 1 |
| 3 | 9 | Notebook | Windows.Desktop | 585.0 | 189551.0 | 2.0 | 5.0 | 2097.0 | NaN | 476940.0 | HDD | 475798.0 | 1 | 4096.0 | Notebook | 1 |
| 4 | 10 | Desktop | Windows.Desktop | 924.0 | 193916.0 | 2.0 | 5.0 | 3195.0 | NaN | 476940.0 | HDD | 198572.0 | 0 | 2048.0 | Desktop | 1 |
| 5 | 11 | Notebook | Windows.Desktop | 525.0 | 331192.0 | 4.0 | 5.0 | 2321.0 | NaN | 476940.0 | HDD | 100000.0 | 0 | 2048.0 | Notebook | 1 |
| 6 | 12 | Notebook | Windows.Desktop | 2206.0 | 229872.0 | 2.0 | 5.0 | 1983.0 | NaN | 476940.0 | HDD | 461312.0 | 0 | 2048.0 | Notebook | 1 |
| 7 | 15 | AllInOne | Windows.Desktop | 3035.0 | 263666.0 | 4.0 | 5.0 | 2407.0 | NaN | 228936.0 | SSD | 228320.0 | 0 | 4096.0 | Desktop | 1 |
| 8 | 17 | Desktop | Windows.Desktop | 3035.0 | 263637.0 | 8.0 | 5.0 | 2966.0 | NaN | 1907729.0 | HDD | 1906339.0 | 0 | 16384.0 | Desktop | 1 |
| 9 | 18 | Notebook | Windows.Desktop | 4730.0 | 310837.0 | 4.0 | 5.0 | 2296.0 | NaN | 715404.0 | HDD | 704278.0 | 0 | 6144.0 | Notebook | 1 |
Last rows
| df_index | Census_MDC2FormFactor | Census_DeviceFamily | Census_OEMNameIdentifier | Census_OEMModelIdentifier | Census_ProcessorCoreCount | Census_ProcessorManufacturerIdentifier | Census_ProcessorModelIdentifier | Census_ProcessorClass | Census_PrimaryDiskTotalCapacity | Census_PrimaryDiskTypeName | Census_SystemVolumeTotalCapacity | Census_HasOpticalDiskDrive | Census_TotalPhysicalRAM | Census_ChassisTypeName | HasDetections | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 222490 | 446217 | Notebook | Windows.Desktop | 2102.0 | 229920.0 | 2.0 | 5.0 | 1998.0 | NaN | 476940.0 | HDD | 461953.0 | 0 | 4096.0 | Notebook | 1 |
| 222491 | 446218 | Notebook | Windows.Desktop | 525.0 | 331178.0 | 8.0 | 5.0 | 2737.0 | NaN | 953869.0 | HDD | 952792.0 | 0 | 8192.0 | Notebook | 1 |
| 222492 | 446219 | Notebook | Windows.Desktop | 585.0 | 189211.0 | 4.0 | 5.0 | 3499.0 | NaN | 122104.0 | SSD | 121488.0 | 0 | 4096.0 | Notebook | 1 |
| 222493 | 446221 | Notebook | Windows.Desktop | 585.0 | 189592.0 | 4.0 | 5.0 | 3063.0 | NaN | 122104.0 | SSD | 122102.0 | 0 | 8192.0 | Notebook | 1 |
| 222494 | 446223 | Notebook | Windows.Desktop | 2206.0 | 244755.0 | 4.0 | 1.0 | 187.0 | NaN | 476940.0 | HDD | 455604.0 | 0 | 4096.0 | Notebook | 1 |
| 222495 | 446224 | Notebook | Windows.Desktop | 525.0 | 331054.0 | 2.0 | 5.0 | 2097.0 | NaN | 476940.0 | HDD | 475863.0 | 0 | 4096.0 | Notebook | 1 |
| 222496 | 446231 | Notebook | Windows.Desktop | 525.0 | 331260.0 | 2.0 | 5.0 | 2012.0 | NaN | 476940.0 | HDD | 454536.0 | 1 | 4096.0 | Notebook | 1 |
| 222497 | 446235 | Desktop | Windows.Desktop | 3150.0 | 328141.0 | 8.0 | 5.0 | 2891.0 | NaN | 102399.0 | SSD | 101898.0 | 0 | 12288.0 | Desktop | 1 |
| 222498 | 446241 | AllInOne | Windows.Desktop | 666.0 | 340349.0 | 4.0 | 5.0 | 2510.0 | NaN | 953869.0 | HDD | 204800.0 | 0 | 16384.0 | AllinOne | 1 |
| 222499 | 446244 | Desktop | Windows.Desktop | 1443.0 | 275846.0 | 2.0 | 5.0 | 3212.0 | NaN | 238418.0 | UNKNOWN | 99147.0 | 0 | 3072.0 | MiniTower | 1 |